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Abstract. The distribution of the sum of independent identically distributed uniform 

random variables is well-known. However, it is sometimes necessary to analyze data which 

have been drawn from different uniform distributions. By inverting the characteristic 

function, we derive explicit formulae for the distribution of the sum of n non-identically 

distributed uniform random variables in both the continuous and the discrete case. The 

results, though involved, have a certain elegance. As examples, we derive from our general 

formulae some special cases which have appeared in the literature. 
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I. Introduction 

The classical uniform distribution is perhaps the most versatile statistical model: appli- 
cations abound in nonparametric statistics and Bayes procedures. Chu (1957) and Leone 
(1961) utilized uniform distributions in connection with sample quasi-ranges. Naus (1966) 
applied uniform distribution in a power comparison of tests of random clustering. For 
additional applications and examples, see Johnson et al (1995). 

Here, we are concerned with the distribution of the sum of n independent non-identically 
distributed uniform random variables. It is well-known that the probability density func- 
tion of such a sum, in which the summands are uniformly distributed in a common 
interval [—a, a], can be obtained via standard convolution formulae: see Feller (1966, p. 
27) or Renyi (1970, pp. 196-197), for example. However, it is sometimes necessary to an- 
alyze data which have been drawn from non-identical uniform distributions. For example, 
measurements accurate to the nearest foot may be combined with measurements accurate 
to the nearest inch. In such cases, the distribution of the sum is more complicated. Tach 
(1958) gives tables to five decimal places of the cumulative distribution of the sum for 
n = 2, 3 and 4 for some special cases. 

The first general result in this direction seems to have been made by Olds (1952), who 
derived the distribution of the sum Y^j=i Xj, in which each Xj is uniformly distributed in 
an interval of the form [0, a,j) with aj > 0. The proof is by induction, and in that respect 
is somewhat unsatisfactory, since in general inductive proofs require knowing beforehand 



SUM OF NON-IDENTICAL UNIFORM RANDOM VARIABLES 3 

the formula to be proved. Subsequently, Roach (1963) deduced what is essentially Olds' 
formula using n-dimensional geometry. Later Mitra (1971), apparently unaware of these 
previous results, derived the distribution of the sum in which each random variable is 
uniformly distributed in an interval of the form using Norlund's (1924) difference 

calculus. 

Here, we derive an explicit formula for the slightly more general situation of the distri- 
bution of the sum Y2™ =1 Xj, in which each Xj is uniformly distributed in an interval of 
the form [cj — aj, Cj + dj] with aj > and Cj an arbitrary real number. Of course, each of 
the aforementioned results can be obtained from ours by specializing the parameters Cj 
and dj accordingly. 

Our approach is via Fourier theory and is quite straightforward; specifically we invert 
the characteristic function. As a result, our formula differs somewhat in form from the 
special cases alluded to previously. However, the inversion technique is quite flexible, and 
readily lends itself to the study of other types of distributions, such as the discrete case, 
which seems not to have been discussed in the literature. Thus, in a similar fashion, we 
derive the distribution of the sum of n random variables with point mass at the integers 
in intervals of the form [—mj,m,j], in which each rrij is a positive integer. The formula 
in the discrete case is somewhat more complicated than the corresponding formula in the 
continuous case; nevertheless, they are clearly closely related, and there is a certain charm 
and elegance to both. Of course, the same results may be obtained using the standard 
transformation methods. 
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2. The Continuous Case 



Fix a positive integer n, and let a — (ai, 


a 2 , ■ 


. . , a n ) and c— (ci, c 2 , . . . , c n ) be vectors of 


real numbers with each aj > 0. For each j 


= 1, 


2, ... ,n, we consider a random variable Xj 


uniformly distributed on the closed interval [cj - 


-a_j, Cj + aj]. The step function Xj '■ R - ► R 


defined for real x by 








1 


if a; — Cj| < Oj, 


(2.1) 2a jXj (x) = * 


i 

2 


if 1 37 C^' | — ? 







if X — Cj > CLj 



represents the density of the random variable Xj for each j = 1,2, ... , n. (When em- 
ploying techniques from Fourier theory, it is convenient to define the densities at jump 
discontinuities so that the equation Xj( x +) + Xj( x ~ ) — %Xj( x ) is satisfied for all real x.) 
The corresponding characteristic function (Fourier transform) is given by 

Xj(t) := ^- [ ' 3 e itx dx = e iCjt sinc(a^), t E R, 

where sinc(x) := x~ x sin a; if x ^ 0, and sinc(0) := 1. For real x, the density of the sum 
Y7j=i Xj is given by the ra-fold convolution f n (x) :— (xi * X2 * • • • * Xn)(^)- Thus, 

/oo /*oo /"OO 

- V2) I X2(V2 - 2/3) • • • / Xn-l(Vn-l - J/n)Xn(j/n) djfc • • ■ dj/n- 
-oo J —oo J —oo 

In particular, if each of the n intervals is centered at 0, and we write x\ = x — 1/2, x n = y n 
and Xj = i)j — yj + i for 1 < j < n, then the conditions on the variables 

x ii x 2i ■ ■ ■ 1 x n 3-re 
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that each \xj\ < aj and Y^=i x j = x - Thus, f n {x) is simply the volume (in the sense of 
Lebesgue measure) of the (n — l)-dimensional region 

{(x 1 ,x 2 , ■ ■ ■ , x n -i) G R™" 1 : \x — YTjZi x j\ < a n and \xj\ < aj for 1 < j < n} 

divided by the volume YYj=i °f the ^-dimensional hyperbox 



(2.2) 



{(x 1 ,x 2 , ■ ■ ■ ,x n ) e R n : \xj\ < dj for j = 1, 2, . . . , n}. 



Despite the utility of these representations, it is desirable to have an explicit formula for 
/„. In this vein, we have the following 

Theorem 1. The density of the sum of n independent random variables, uniformly dis- 
tributed in the intervals [cj — aj, Cj + af\ for j = 1,2, ... ,n, is given by 



fn{x) 



n-1 



(2.3) 



ee{-i,i}" i =1 



x sign ( x + Y^( £ 3 a 3 ~ c i) ) \\ e 



3=1 



3=1 



[n 



i)!2- +i n^ 



J'=l 



in which the sum is over all 2™ vectors of signs 



e — (s-i, e 2 , ■ ■ ■ , e n ) e {— 1, l} n i.e. each Ej = ±1 



and 



sign(y) := < 



1 ify>0, 
ify = 0, 
-1 ify<0. 
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Proof. Since the random variables are assumed to be independent, the characteristic func- 
tion of the distribution of the sum is the product of the characteristic functions of their 
distributions: 

n n 

(2.4) Ut) = n xm = n ettcj sinc M- 

Mitra (1971, p. 195) remarks that "It is difficult to obtain an inverse Fourier transform 
of this product in a neat form." Indeed, his approach is to expand the product of sines 
into a power series using generalized Bernoulli polynomials. However, we shall see that 
the inverse Fourier transform of (|2.4J) has an elegant representation involving a sum over 
the vertices of the hyperbox (J2.2|) . 

Since for all real x, f n (x) = |/ n (x+) + |/ n (x— ) by continuity of f n for n > 1, and by 
definition of xi when n — 1, Fourier inversion gives 

-i /*oo -i roc n 

Ux) = — / e- itx f n (t) dt = — e-*» J] smc{ aj t) dt, 

where y := x — YTj=i c j- Since sine is an even function, making the change of variable 
£ i — > — t yields 



(2.5) f n (x) = — / e** TTsinc( aj t) dt. 

2tt J.^ ^ jl 



i=i 

It remains only to evaluate the integral f)2.5j) . Related integrals are studied in Borwein 
and Borwein (2001) using a version of the Parseval/Plancherel formula, and by expanding 
the product of sines into a sum of cosines. Our approach is somewhat more direct. We 
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first express the sine functions using complex exponentials, so that 

(2.6) fn (x) =^[ Yi ) (n a ?) / _ ^ n - e_ " j ) dt - 

For each of the 2 n vectors of signs e = (si, £2, ■ ■ ■ , s n ) G { — 1, 1}", let 



p £ := Y\_ £ j> anc ^ e-a = V^e^ay. 
By carefully expanding the product of exponentials in ()2.(ij) . we find that 



(2.7) JJ(e^_ e -^) = ^ Pe exp(ite • 3). 

i=i re{-i,i} n 

It follows that 

(2.8) f n (x) = — - Ml a 7M E ^ PV " / r " ex P^^ + r - d *- 

^ V^V V J=1 / e - e{ _ lil} n J -CO 

Although each of the individual integrals in (J2.8)) is divergent, the singularities must 
cancel because (|2.6|) converges. Therefore, the required finite integral (|2.6j) is equal to its 
principal value, and hence by linearity is given by ([2.8)1 . 

In view of the fact that (j2.7J) is entire with a zero of order n at t = 0, we may inte- 
grate (j2.8J) by parts n — 1 times and thereby obtain 

n / n \ ■ri—X 

/oo 
r 1 exp(it(y + e- a)) dt. 
-co 



12.9) 
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But for any real number b, we have 

/oo 
t -1 exp(itfe) dt 
■oo 

= tim^jy t~ l exp(zifr) dt + J t' 1 exp(itb) dtj 

f /*oo / , oo 

= lim <j/ t^ 1 exp(itb) dt — J t^ 1 exp(-itb) dt 

POO 

= 2i r l sin(tb)dt 
Jo 

= iir sign(fr). 

Applying this latter result to (|2.9|) yields ()2.3|) and completes the proof of Theorem [T] □ 

In some applications, it may be easier to work with powers of expressions involving the 
maximum function y + := max(y, 0) as opposed to the sign functions in (j2.3jl . To this end, 
we make the following 



Definition 1. Let r : R — > R be given by 

1, if x > 0, 

(2-10) r(x)={ I, if 3 = 0, 

0, ifx<0, 

and for y real and n a positive integer, let i/™ -1 := y n ~ l T(y). Note that y° = r(y) and 
= (max(y, 0)) n for n > 0. 



Then we have the following corollary to Theorem 
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Corollary 1. Let f n be as in Theorem^ Then 



(2.11) f n (x) 



(n \ n — 1 n 1 I T n ■ 

1- !-Ji" 3 1 ; : J/ L 3=1 ■ 



•ee{-i,i} n i= l + i =1 

Proof. Note that sign(x) = 2t(x) — 1 holds for all real x. Hence, substituting the r 
function for the sign function in ([2.3)1 . we see that it suffices to prove the identity 

(n \ n—1 n 

•'• • >>/',-':,)) n ^ 
ttl -i,i r j=i ' i=i 

Since e a ^ - e~ a ^ = 2a jt + 0(t 2 ) as t -> 0, (|2~T2l follows easily on comparing coefficients 
of r -1 in 

( x + ^2 W - * ) t \ n ei = e** n e_cj< - e ~ aj4 ) • 

Alternatively, note that if x > 52y=i( c i+ a i)) then / n (x) = by definition: since each Xj is 
u[cj — a,j, Cj + aj], the sum J^" =1 Xj must fall within the interval E™ =1 (cj — %), X]j=i( c i + 
a,,)]. But, if x > YTj=i( c j + a i)) then we can drop the subscripted "+" from (|2.11j) 
since x + Y^j=i( £ j a j ~ c j) — for each e { — 1, l} n . It follows that (|2.12j) holds for all 
x > X]j=i( c i + a i)- Since the left hand side of (|2.12|) is a polynomial in x which vanishes 
for all sufficiently large values of x, it must in fact vanish for all real x by the identity 
theorem. □ 

Example 1. When n — 1, formulae ()2.3|) and (|2.11|) give the central difference representa- 
tions 

sign(x — Ci + ai) — sign(x — c\ — ai) r(x — C\ + ai) — r(x — c\ — a x ) 



XiO) 



Aai 2a\ 
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respectively. Both are equivalent to the definition (j2.1j) with j = 1. 
Example 2. When n = 2, we know that 

/oo 
xi(x-y)x2{y) dy. 
-oo 

If ci = C2 = 0, the convolution reduces to 

1 y**i(*+«i.M) m in(x + ai, a 2 ) - maxfi - a u -a 2 ) 
/2 W = "j / = 1 

J max(i-ai , —02) 

4«ia 2 



so that, in particular, 
(2-13) /2 ( ) = i-^ 



1 f°° sin(ait) sin(a 2 t) min(ai,a 2 ) 



ait a 2 t 2a x a 2 

On the other hand, in light of the fact that ysign(y) — \y\ for y real, formula ()2.3|) gives 

f fn\ — \ ai + a2 \ ~ \ ai — fl 2| — I — + 0-2! + I — — 0-2! 

MU) - 

oaia 2 

2(ai + a 2 ) — 2|ai — a 2 | 
8aia 2 

min(ai, a 2 ) 
2aia 2 ' 

in agreement with ()2.13|) . Alternatively, since yr(y) = y + = max(y, 0), formula ()2.3|) gives 
max(a! + a 2 , 0) — max(ai — a 2 , 0) — max(— a\ + a 2 , 0) + max(— a\ — a 2 , 0) 



/ 2 (0) 



4aia 2 

a\ + a 2 — max(ai — a 2 , 0) — max(a 2 — a±, 0) 



4aia 2 



min(ai, a 2 J 
2aia 2 
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Example 3. If the random variables comprising the sum are uniformly distributed in a 
common interval centered at — say [—a, a] with a > — then formula (j2.11j) gives 



fn{x) 



(n-l)!(2o)» ^ , V~ ' 



J2 (x + aJ2 



n \ n-1 n 



n 

i=i 



Sj. 



If A; of £x; e 2i ■ ■ ■ , £n are negative and the remaining n — are positive, then summing over 
k yields 



fn(x) 



[n 



2k)a 



n-l 

+ ' 



in agreement with Feller (1966). 



Example 4. If for each j = 1, 2, . . . , n we set Cj — dj and then replace aj by dj/2, then Xj 
will be uniformly distributed in [0, aj] and will have density Xj now given by 



(2.14) 



1, if < x < a 



'3i 



|, if x = aj or if x = 0, 



0, if x > aj or if x < 0. 



Formula (J2.11|) of Corollary Q now gives 



fn(x) 



E *-E 



1-Ej 



£G{-l,l} n V J = 1 



rb / («-!)' ii' 



E (-i: 

se{o,i}™ 



Ss / -» -An— 1 

(a; — s • a) . 



+ j=i 

n 

(n-l)! J] a, 



where the sum is now over all 2 n vectors s = (sx> ^2, • • • , s n ) in which each component 
takes the value or 1, Ss denotes the sum of the components Si + S2 + ■ — h s n and s- a 
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denotes the dot product siOi + s 2 a 2 + ■ • ■ + s n a n . If we now break up the sum according 
to the number of non-zero components in the vector s, we find that 



fn{x) 



(n \ Ti— 1 n 

■ / 1 yj •■■ /,. • • k=l 



[n — 



3=1 



which is equivalent to the formula of Olds (1952, p. 282 (1)) when n > 1. When n — 1, 
the two formula? differ at a; = and at x — a% because Olds uses the density which is 
1/ai for < x < ai and otherwise, in contrast to our more symmetrical density (j2.14|) . 

3. The Discrete Case 



Here, we fix n positive integers mi, rri2, ■ ■ ■ , m n . For each j = 1,2, ... ,n, we now 
consider a random variable Xj uniformly distributed on the set of (2rrij + 1) integers 
contained in the closed interval [— mj,mj], i.e. the set {— rrij, 1 — rrij, . . . , rrij — 1, rrij}. The 
mass function of Xj is the rational-valued function of an integer variable given by 



(3.1) 



(2m j + l)xj{p) ■-- 



1 if \p\ < rrij, 
if \p\ > rrij. 



We seek a formula analogous to ()2.3|) for the probability mass function of the sum X^=i -^-j 5 
namely the n-fold convolution 



(3.2) 



9n(p) ■= (Xl*X2*---*Xn)(p) = Y 11^'^^' 

fcl+faH hk n =P 3=1 
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where the sum is over all integers ki, k 2 , ■ ■ ■ , k n such that ki + k 2 + ■ ■ ■ + k n = p. Al- 
though (|3.2|) is a formula of sorts, the various conditions on the summation indices make 
it inconvenient to apply. For example, when n = 3, let Uk = Uk{p) '■ = min(m 2 ,p — k + rriz) 
and Vk = Vk{p) '■= max(— m 2 ,p — k — m 3 ). Then 



where the sum is over all integers k for which Uk < Vk and \k\ < m\. In general, one 
needs to consider cases which depend on the size of p in relation to various signed sums of 
subsets of the parameters mi, . . . , m n . The number of cases to be delineated increases 
exponentially with n. Thus, the situation becomes rapidly unwieldy. Fortunately, there 
is alternative approach provided by Fourier theory. With the convolution g n : Z — >• Z 
defined as above, we have 

Theorem 2. For all integers p, 




3 



u k <i> k 




n 



n 



(3.3) 




where 



n 



M := H(2 mj + l)-\ 
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and the rational numbers are the coefficients in the Laurent series expansion 

/ i \ n 00 



SIM , 

' k=0 



Explicitly, 
(3.5) 



Proof. The corresponding characteristic functions for j = 1, 2, . . . , n are now given by 

f sm((m ; + / 1 / 2)t) , ift*2*Z, 

(2m j + l)x j (t)= Sm(t/ } 

l fc l^ m i I 2 mj + 1, if t G 2ttZ, 

which we recognize as the familiar Dirichlet kernel — see eg. Korner (1988, p. 68). Since 
for t ^ 27rZ, we have 

x tt ~ / x , ,tt sin((T7i 7 - + l/2)t) 

(Xl*X2*---*Xn) W = II^) = M II sin(t/2) » 

j=l j=l 

it follows by orthogonality of the exponential that 



27V Jo J=i sin(t/2) tt Jo ^ sinx 



We remark in passing that the factors sin((2mj+l)x) / sin x in this latter representation are 
simply the Chebyshev polynomials C^m^ (cos x) of the second kind — see eg. Abramowitz 
and Stegun (1972, p. 766). The partial-fraction expansion 

(n-l)/2 00 / r_]\ r \ n-2k 



< 3 ' 6 > - E 4? E 



sin x 7 \ x + r7r 

fc=0 r=— 00 
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is a modified version of the formula in Schwatt (1924, pp. 209-210), and yields 

(n-l)/2 oo w , /_i\r \ n—2k n 

(3.7) 9n (p)= E ^ E - / e2ipx (hh) U^ 2m i+ 1 » dx - 

k=0 r=-oo J ° ^ ' j=l 

When n — 2k > 1, the bilateral series 



r=— oo 



X + T7T / 

converges absolutely, and the interchange of summation and integration is easily justified 
using either Lebesgue's dominated convergence theorem or Fubini's theorem with both 
Lebesgue and counting measure. If n — 2k = 1, absolute convergence can be recovered by 
recasting the bilateral series in the form 



z — ' X -\- T1X x z — ' \x A — r z 7T / 

r=— oo r=l s / 



and the justification proceeds as in the previous case. 

We shall see that the inner sum of integrals in (|3.7|) can be expressed as a single integral 
over the whole real line. To this end, we compute 

oo n7r / / i\r \ n—2k n 



=-oo JU \ 1 / j =1 

E / fk~n e 2ip(t-m) -Q gin ((2m . + _ r7r )) ^ 

=-oo 71 J ™ j=l 

/ t 2k - n e 2ipt JJ(-l) (2 ^ +1)r sin((2m j + l)t)rft 

=-oo 71 J™ 3=1 

1 r°° n 

- I t 2k ~ n e 2ipt TT sin((2m i + l)t) di. 

n J-oo 7" i 



16 SUM OF NON-IDENTICAL UNIFORM RANDOM VARIABLES 

This latter integral can be evaluated just as we evaluated the integral (|2.5jl . After ex- 
panding the product of sines as a sum over the constituent exponentials and integrating 
by parts n — 2k — 1 times, one finds that 

1 poo n 

(3.8) - / t 2k ~ n e 2ipt Y[ sin((2m i + l)t) dt 

nn-2 -!)■ E ^+B^ + lfe 

2p + ^{2m 3 + l)eA JJe,, 

.7 = 1 ' .7=1 



X Sl£ 



Substituting ()3.8|) into ()3.7|) and interchanging the order of summation completes the 
proof of Theorem |21 □ 

Remark. The equations ()3.4|) . ()3.5|) . ()3.6|) can also be obtained using Jordan's (1979, 
§74, p. 216) general formula for the higher derivatives of a power of a reciprocal function 
and then applying Mittag-Leffler's theorem. 

As in the continuous case, we can replace the sign function in Theorem El with the r 
function of Definition ^ Thus we obtain 



Corollary 2. Let g n , M, and be as in Theorem® Then, for all positive integers 
and integer p, 

(n-l)/2 k ( n ) / n \ n-2fc-l n 



n 



9.(p) - £ e ( , : I % e (^Ep'^fe) it 

k=0 V 7 ' £ 6 {-l,l}n ^ j=l 7 + j = l 

where y^T 2 ^ 1 = y n ~ 2k ~ 1 r{y) as in Definition^ 
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Proof. Making the substitution sign(y) = 2r(y) — 1 in Theorem |21 and noting that the 
coefficient of t n ~ 2k ~ l in 



exp | Up + J2( 2m i + ^V) = e " pt f[ (e (2mj+1)t - e 



(2m i +l)t N i 



ee{-l,l} T 

vanishes for < k < {n — l)/2, we obtain the desired result. □ 

Example 5. When n = 1, formulae (|3.3j) and (|3.9|) give the representations 

(2mi + l)xi(p) = |sign(p + m x + \) - §sign(jo - m 1 - \) 
= r{p + mi + \) - r(p - mi - |), 
respectively. Both are equivalent to the definition (|3.1J) with j = 1. 

Example 6. When n = 2, we have M := (2mi + l) _1 (2m 2 + l) -1 , and 



92(p) = Xi * X2(p) = 5^ Xl(^)X2(j)- 
Since this is simply the coefficient of t p in the product 



M yj t- yj f, 

|fc|<mi | J7 | < 

letting := min(mi,p + m 2 ) and t> (p) := max(— m 1; p — m 2 ), we have 



02 (p) = M X 



— f (p) + 1 if -u(p) > f (p), 
if -u(p) < v{p). 
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On the other hand formula ()3.3|) gives the elegant representation 

92(p) = \M (\p + mi + m 2 + 1| - \p + mi - m 2 \ - \p - m 1 + m 2 \ 
+ \p -m x -m 2 - 1|) , 

in which we have used the relation ysign(y) = \y\ for y real. 
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